Modeling prosody for language identification on read and spontaneous speech
نویسندگان
چکیده
This paper deals with an approach to Automatic Language Identification using only prosodic modeling. The actual approach for language identification focuses mainly on phonotactics because it gives the best results. We propose here to evaluate the relevance of prosodic information for language identification with read studio recording (previous experiment [1]) and spontaneous telephone speech. For read speech, experiments were performed on the five languages of the MULTEXT database [2]. On the MULTEXT corpus, our prosodic system achieved an identification rate of 79 % on the five languages discrimination task. For spontaneous speech, experiments are made on the ten languages of the OGI Multilingual telephone speech corpus [3]. On the OGI MLTS corpus, the results are given for languages pair discrimination tasks, and are compared with results from [4]. As a conclusion, if our prosodic system achieves good performance on read speech, it might not take into account the complexity of spontaneous speech prosody.
منابع مشابه
Common and Language Dependent Phonetic Differences Between Read and Spontaneous Speech in Russian, Finnish and Dutch
This preliminary study aims to reveal both common and language-specific phonetic differences between read and spontaneous speech in three typologically unrelated languages – Russian, Finnish, and Dutch. These languages differ in prosody, sound systems, speech styles, and means for conveying intonational meaning. Spontaneous speech was recorded from 5 to 8 speakers in each language. Transliterat...
متن کاملProsody for Mandarin speech recognition: a comparative study of read and spontaneous speech
In this paper, we present a comparative study between spontaneous speech and read Mandarin speech in the context of automatic speech recognition. We focus on analysis and modeling of prosodic features, based on a unique speech corpus that contains similar amounts of read and spontaneous speech data from the same group of speakers. Statistical analysis is carried out on tone contours and duratio...
متن کاملAnnotation Conventions and Corpus Design in the Investigation of Spontaneous Speech Prosody in Taiwanese
Understanding how intonational phrasing and focal prominence interact with lexically specified tone patterns is one of several problems in the investigation of speech processing in Chinese languages that cannot be addressed fully with read speech alone. This paper explores such problems for Taiwanese, one of the major languages in the southern Min dialect group. It outlines what is known about ...
متن کاملChinese Prosody and Prosodic Labeling of Spontaneous Speech
In this paper some prosodic research on read and spontaneous speech is introduced first, then the difference between read and spontaneous speech will be depicted, and finally the prosodic labeling system C-ToBI will be described.
متن کاملEstimating speaker-specific intonation patterns using the linear alignment model
Modeling speaker-specific intonation is important in several areas, including speaker identification, verification, and imitation using text-to-speech synthesis. However the choice of the intonation model and the estimation of its parameters from spontaneous speech remains a challenge. We propose a way to estimate speaker-specific intonation parameters for a particular superpositional model, th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003